Siamese Network for RGB-D Salient Object Detection and Beyond

نویسندگان

چکیده

Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as independent information design separate networks for feature extraction from each. Such schemes can easily be constrained by a limited amount of training data or over-reliance on an elaborately designed process. Inspired the observation that modalities actually present certain commonality in distinguishing objects, novel joint learning densely cooperative fusion (JL-DCF) architecture is to learn both inputs through shared network backbone, known Siamese architecture. In this paper, we propose two effective components: (JL), (DCF). The JL module provides robust saliency exploiting cross-modal via network, while DCF introduced complementary discovery. Comprehensive experiments using 5 popular metrics show framework yields detector with good generalization. As result, JL-DCF significantly advances SOTAs average ~2.0% (F-measure) across 7 challenging datasets. addition, readily applicable other related multi-modal tasks, including RGB-T SOD video SOD, achieving comparable better performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Local Background Enclosure for RGB-D Salient Object Detection - Supplementary Results

The purpose of this supplementary material is to examine in detail the contributions of our proposed Local Background Enclosure (LBE) feature. A comparison of LBE with the contrast based depth features used in state-of-the-art salient object detection systems is presented. The LBE feature is compared with the raw depth features ACSD [1], DC [3] and a signed version of DC denoted SDC on the RGBD...

متن کامل

RGB-D Salient Object Detection Based on Discriminative Cross-modal Transfer Learning

In this work, we propose to utilize Convolutional Neural Networks (CNNs) to boost the performance of depth-induced salient object detection by capturing the high-level representative features for depth modality. We formulate the depth-induced saliency detection as a CNN-based cross-modal transfer problem to bridge the gap between the " data-hungry " nature of CNNs and the unavailability of suff...

متن کامل

Learning Graph Matching for Object Detection from Rgb-d Images

We propose an optimization method for estimating parameters in graph-theoretical formulations of the matching problem for object detection. Unlike several methods which optimize parameters for graph matching in a way to promote correct correspondences and to restrict wrong ones, our approach aims at improving performance in the more general task of object detection. In our formulation, similari...

متن کامل

Frustum PointNets for 3D Object Detection from RGB-D Data

While object recognition on 2D images is getting more and more mature, 3D understanding is eagerly in demand yet largely underexplored. In this paper, we study the 3D object detection problem from RGB-D data captured by depth sensors in both indoor and outdoor environments. Different from previous deep learning methods that work on 2D RGB-D images or 3D voxels, which often obscure natural 3D pa...

متن کامل

Semantic Parsing for Priming Object Detection in RGB-D Scenes

The advancements in robot autonomy and capabilities for carrying out more complex tasks in unstructured indoors environments can be greatly enhanced by endowing existing environment models with semantic information. In this paper we describe an approach for semantic parsing of indoors environments into semantic categories of Ground, Structure, Furniture and Props. Instead of striving to categor...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Pattern Analysis and Machine Intelligence

سال: 2021

ISSN: ['1939-3539', '2160-9292', '0162-8828']

DOI: https://doi.org/10.1109/tpami.2021.3073689